SPCS: a spatial pyramid convolutional shuffle module for YOLO to detect occluded object
نویسندگان
چکیده
Abstract In crowded scenes, one of the most important issues is that heavily overlapped objects are hardly distinguished from each other since their pixels shared and visible occluded objects, which used to represent features, limited. this paper, a spatial pyramid convolutional shuffle (SPCS) module proposed extract refined information limited generate distinguishable representations for objects. We adopt four kernels with different sizes dilation rates at location in features adjacently recombine fused outputs spatially using pixel module. way, instance predictions corresponding can be produced feature. addition, multiple operations kernel same regions, helpful pixels. Extensive experimental results demonstrate SPCS effectively boost performance human detection. YOLO detector achieves 94.11% AP, 41.75% MR, 97.75% Recall on CrowdHuman, 93.04% 98.45% WiderPerson, best compared previous state-of-the-art models.
منابع مشابه
Parallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers
This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...
متن کاملConvolutional Gating Network for Object Tracking
Object tracking through multiple cameras is a popular research topic in security and surveillance systems especially when human objects are the target. However, occlusion is one of the challenging problems for the tracking process. This paper proposes a multiple-camera-based cooperative tracking method to overcome the occlusion problem. The paper presents a new model for combining convolutiona...
متن کاملConvolutional Neural Pyramid for Image Processing
We propose a principled convolutional neural pyramid (CNP) framework for general low-level vision and image processing tasks. It is based on the essential finding that many applications require large receptive fields for structure understanding. But corresponding neural networks for regression either stack many layers or apply large kernels to achieve it, which is computationally very costly. O...
متن کاملUsing Support Vector Machines, Convolutional Neural Networks and Deep Belief Networks for Partially Occluded Object Recognition
Using Support Vector Machines, Convolutional Neural Networks and Deep Belief Networks for Partially Occluded Object Recognition Joseph Lin Chu Artificial neural networks have been widely used for machine learning tasks such as object recognition. Recent developments have made use of biologically inspired architectures, such as the Convolutional Neural Network, and the Deep Belief Network. A the...
متن کاملSpatial Pyramid Convolutional Neural Network for Social Event Detection in Static Image
Social event detection in a static image is a very challenging problem and it’s very useful for internet of things applications including automatic photo organization, ads recommender system, or image captioning. Several publications show that variety of objects, scene, and people can be very ambiguous for the system to decide the event that occurs in the image. We proposed the spatial pyramid ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Complex & Intelligent Systems
سال: 2022
ISSN: ['2198-6053', '2199-4536']
DOI: https://doi.org/10.1007/s40747-022-00786-7